ci/test_matrix.yml: test with PyPI cuda-toolkit 13.3.0#2140
Conversation
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
0bd613f to
836c875
Compare
|
/ok to test |
This comment has been minimized.
This comment has been minimized.
PR 2140: First CI analysis of 13.3.0 failuresWorkflow run inspected: SummaryThe failures are not affecting all CUDA 13.3.0 test lanes. The CUDA 13.3.0 The common failure mode is NVRTC compilation failing because CUDA C++/CCCL Other failing tests show the same class of missing-header problem: and on Windows example tests: Observed CI PatternPassing:
Failing:
Representative failed jobs include:
Root CauseThe cuda_ccclFor CUDA 13.3.0, the redistrib metadata no longer uses that key. It uses: Evidence from redistrib metadata: The 13.3.0 logs show the helper filtering out the old component name: After that, the local mini-CTK is assembled without CCCL headers, so tests that
Why Wheel Lanes PassThe wheel lanes do not rely on this locally assembled mini-CTK in the same way. This is why the apparent failure is broad across local 13.3.0 tests, but not Suggested FixUpdate the mini-CTK component resolution to handle the redistrib component Reasonable approaches:
The first option is cleaner because existing workflow inputs can keep using the After the fix, rerun a small representative subset first:
If those pass, rerun the full PR CI. |
|
/ok to test |
This comment has been minimized.
This comment has been minimized.
1 similar comment
|
Description
Update CI coverage to test against CUDA Toolkit 13.3.0 now that
cuda-toolkit 13.3.0is available on PyPI (posted 2026-05-27 at 07:36 AM PDT).Similar to the changes under PR #1745 for the CUDA 13.2.0 release, this replaces the CUDA 13.2.1 entries in the pull-request and nightly CI matrices with CUDA 13.3.0, while preserving the existing spread of Python versions, platforms, GPUs, and local-vs-wheel CUDA Toolkit coverage.
While validating the matrix update, the CUDA 13.3.0 local-CTK lanes exposed a redistrib metadata change: CTK 13.3.0 renamed the CCCL component key from
cuda_cccltocccl. The mini-CTK fetch helper now resolves that renamed component so local-CTK jobs still install the CCCL headers required by NVRTC tests.Changes
ci/test-matrix.ymlfrom CUDA 13.2.1 to CUDA 13.3.0 for PR and nightly coverage.ci/tools/fetch_ctk_redistrib.pyfor thecuda_cccl->ccclredistrib key rename introduced with CTK 13.3.0.ci/tools/tests/test_fetch_ctk_redistrib.py.